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CROSS REFERENCE TO RELATED APPLICATIONS 

(Not Applicable) 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH OR DEVELOPMENT 

(Not Applicable) 

BACKGROUND OF THE INVENTION 

1. Technical FifilH 

This invention relates to the field of connputer speech recognition and 
more particularly to a method and system for executing voice commands 
having ordinary dictation as a parameter. 

2. Descrintin n of tHr Related Art 

Speech recognition is the process by which an acoustic signal received 
by microphone is converted into a set of words by a computer. These 
recognized words may then be used in a variety of computer software 
applications. For example, speech recognition may be used to input data, 
prepare documents and control the operation of system and application 
software. 

Speech recognition systems can recognize and insert dictated text in a 
variety of software applications. For example, one can use a speech system to 
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dictate a letter into a word processing document. Simply stated, a speech 
recognition engine receives the user's dictated words in the form of speech 
signals, which it processes using known algorithms. The processed signals are 
then "recognized" by identifying a corresponding text phrase in a vocabulary 
database. The text is then conveyed to an active software application, where 
it Is displayed. This type of spoken utterance is considered to be ordinary 
dictation because it Is merely transcribed and does not execute a control 
command. 

As mentioned, the speech recognition system may also be used to 
control the operation of voice-enabled system and application software. 
Typically, the software Is controlled by a user issuing voice commands for 
performing system or application events. There are two broad categories of 
speech recognition systems for executing voice commands: natural language 
understanding (NLU) systems and finite state grammar systems. NLU systems 
permit total linguistic flexibility In command expression by recognizing as 
commands, spoken phrases in terms naturally intuitive to the speaker. For 
example, an NLU system is likely to recognize the spoken utterance, "would 
you be a dear and open the Pensky file for me?", as instructing the system to 
execute a "file open" command for the file named "Pensky". However, NLU 
systems are extremely complex, and at this point, operate only on very 
sophisticated computers. 
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Consequently, the vast majority of comnnercial speech recognition 
systems are finite grammar systems. In a simple finite grammar system, the 
user in the above example would utter a much more structured phrase, such 
as, "open file Pensky". Upon receiving the speech signals corresponding to the 
spoken phrase, the speech recognition engine processes the signals to 
determine whether they correspond to a command coded within one or more 
command sets or grammars. If so, the command is processed and executed by 
the software so as to perform the corresponding event, in this case, opening 
the "Pensky" file. 

The simplest command grammars correlate each command or function 
that the system can perform to one speech command. More advanced finite 
state grammar systems allow for increased linguistic flexibility by including 
alternative commands for performing each function, so that a speaker can utter 
any one of a number of expressions to perform the event. Typically, these 
systems convert spoken phrases into one of a finite set of functional 
expressions using translation rules or by parsing annotation in the grammar. 
These systems, despite having a finite grammar system, enable a user to speak 
more naturally when issuing voice commands. 

As stated, existing speech recognition systems are capable of receiving 
speech signals from a user and either recognizing the signals as ordinary 
dictation or as a voice command for performing an event. However, typical 
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speech systems are unable to recognize voice conrimands that include ordinary 
dictation so as to execute a comnnand having dictation as a paranneter. 

One example of such a voice command is, "send a note to Bill regarding 
today's meeting", which is intended to call up an E-mail application that will 
send a message to a colleague named "Bill" with "today's meeting" displayed in 
the message subject text field. Typical speech systems are likely to interpret 
this statement as ordinary dictation, transcribing the entire spoken phrase as 
text in a document, despite the fact that it includes elements of both a 
command and ordinary dictation. Alternatively, the statement may be 
recognized only as a command to execute the E-mail application, without 
inserting the dictation "today's meeting" in the subject line. 

A basic reason existing speech systems have difficulty with these types 
of mixed voice commands is that the command grammars contain only a finite 
number of command patterns. It is impractical, if not impossible, to code into a 
command grammar the tens of thousands of words or word combinations in a 
given language. Thus, typical systems limit the grammar sets to contain 
phrases indicating functions relevant to performing computer software events. 
These functional phrases comprise a much smaller sub-set of an entire 
language, yet are extremely useful in carrying out software application events. 
Because the vast majority of phrases used in ordinary dictation are left out of 
the command grammars, typical finite speech systems are unable to 
incorporate the dictation portion in commands. 
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Accordingly, there is a need to provide a finite grammar speech 
recognition system able to execute voice commands having ordinary dictation 
as a parameter. 

SUMMARY OF THE INVENTION * 
The present invention provides a method and system to execute voice 
commands, having ordinary dictation as a parameter, for performing system 
and application software events. 

Specifically, in a system adapted for speech recognition, the present 
invention provides a method for executing a voice command in the form of a 
spoken utterance having a dictation portion. The method begins by receiving a 
user input corresponding to the spoken utterance. This input is processed to 
identify a pattern of words forming the spoken utterance which matches a pre- 
determined command pattern. A computer system command is identified that 
corresponds to the pre-determined command pattern and has at least one 
parameter. The one or more parameters are extracted from words contained in 
a dictation portion of the voice command which are distinct from the pattern of 
words matching the command pattern. The computer system command is then 
processed to perform an event in accordance with the one or more command 
parameters. 

Another aspect of the invention is that the words forming the dictation 
portion of the voice command may be embedded within the pattern of words 
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matching the command pattern. The dictation portion of the voice command 
can be comprised of any set of words in a voice recognition engine vocabulary. 
Consequently, the event performed by the system can include inserting the 
dictation portion of the spoken utterance at a location in a word processing 
document or any other location specified by the computer system command. 

Still another aspect of the invention is that the system may identify a 
pattern of words in the spoken utterance to match any one of a plurality of the 
pre-determined command patterns. Each of the plurality of command patterns 
can belong to at least one pre-determined command pattern set. According to 
a preferred embodiment, a command pattern in any of the sets can only be 
matched when the set is in an active state. The command pattern sets are 
placed in an active state according to the operating state of the computer 
system. If no pattern of words forming the spoken utterance matches the pre- 
determined command pattern, the system provides a software application with 
recognized text. 

Another preferred embodiment of the present includes a system for 
executing a voice command in the form of a spoken utterance having a 
dictation portion. Specifically, the system includes programming for receiving a 
user input corresponding to the spoken utterance. The system also includes 
programming for identifying a pattern of words forming the spoken utterance 
which matches a pre-determined command pattern as well as a computer 
system command that corresponds to the pre-determined command pattern and 
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has at least one parameter. Also, the system includes programming for 
extracting the one or more parameters from words contained in a dictation 
portion of the voice command, which are distinct from the pattern of words 
matching the command pattern. The computer system command is then 
processed to perform an event in accordance with the one or more command 
parameters. 

Another aspect of this system is that the words forming the dictation ■ 
portion of the voice command may be embedded within the pattern of words 
matching the command pattern. The dictation portion of the voice command 
can be comprised of any set of words in a voice recognition engine vocabulary. 
Consequently, the system can include programming for inserting the dictation 
portion of the spoken utterance at a location in a word processing document or 
any other location specified by the computer system command. 

Yet another aspect of this system is that it includes programming to 
identify a pattern of words in the spoken utterance to match any one of a 
plurality of the pre-determined command patterns. Each of the plurality of 
command patterns can belong to at least one pre-determined command pattern 
set. According to another preferred embodiment, a command pattern in any of 
the sets can only be matched when the set is in an active state. The command 
pattern sets are placed in an active state according to the operating state of the 
computer system. If no pattern of words forming the spoken utterance 
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matches the pre-determined command pattern, programming is included to 
provide a software application with recognized text. 

Thus, the present invention provides the object and advantage of 
recognizing spoken utterances that include a combination of voice commands 
and ordinary dictation. Once recognized, events can be performed according to 
the dictation portion of the spol<en utterances. 

These and other objects, advantages and aspects of the invention will 
become apparent from the following description. In the description, reference 
is made to the accompanying drawings which form a part hereof, and in which 
there is shown a preferred embodiment of the invention. Such embodiment 
does not necessarily represent the full scope of the invention and reference is 
made therefore, to the claims herein for interpreting the scope of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a computer system for speech recognition with which the 
method and system of the present invention may be used; 

Fig. 2 is a block diagram showing a typical architecture for the computer 
system of Fig. 1 having a speech recognition engine; 

Fig. 3 is a block diagram showing the architecture for a speech 
recognition engine using multiple constraints in the recognition process; and 

Fig. 4 is a flow chart showing the process for executing voice commands 
Incorporating ordinary dictation according to the present invention. 

QBMKE\4383559.3 9 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
Referring to the drawings In detail, wherein like reference characters 
represent corresponding elements throughout the several views, more 
specifically referring to Fig. 1, a computer system with which the present 
invention may be practiced is referred to generally by reference number 10. 
Referring to Figs. 1 & 2, the computer system 10 is preferably comprised of a 
computer 12 having a central processor 14, at least one memory device 16 and 
related electronic circuitry (not shown). The computer system 10 also includes 
user Input devices, a keyboard 18 and a pointing device 20, a microphone 22, 
audio loud speakers 24, and a vicfeo display 26, all of which are operatively 
connected to the computer 12 via suitable interface circuitry. The keyboar'd 
1 8, pointing device 20 and loud speakers 24 may be a part of the computer 
system 10, but are not required for the operation of the Invention. 

Generally, the computer system 10, as described above, can be satisfied 
by any one of many high-speed multi-media personal computers commercially 
available from manufacturers such as International Business Machines 
Corporation, Compaq, Hailed Packard, or Apple Computers. The memory 
device 16 preferably Includes an electronic random access memory module and 
a bulk storage device, such as a. magnetic disk drive. The central processor 14 
may Include any suitable processing chip, such as any of the Pentium family 
microprocessing chips commercially available from Intel Corporation. 
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Referring to Fig. 2, which illustrates a typical architecture for a computer 
system 10 adapted for speech recognition, the system includes an operating 
system 28 and a speech recognition system 30. The speech recognition 
system 30 comprises a speech recognition engine application 32 and a voice 
navigation application 34. A speech text processor application 36 may also be 
included. However, the invention is not limited in this regard and the speech 
recognition engine application 32 can be used with any other application 
program which is to be voice enabled. Also, the speech recognition engine 32, 
voice navigator 34 and text processor 36 are shown in Fig. 2 as separate 
application programs. It should be noted, however, that these applications 
could be implemented as a single, more complex application. 

In a preferred embodiment, the operating system 28 is one of the 
Windows family of operating systems, such as Windows NT, Windows '95 or 
Windows '98, which are available from Microsoft Corporation of Redmond, 
Washington. The present invention is not limited in this regard, however, as it 
may also be used with any other type of computer operating system. 

Referring still to Fig. 2, in general, an analog audio signal containing 
speech commands is received by the microphone 22 and processed within the 
computer 12 by conventional audio circuitry, having an analog to digital 
convertor, which produces a digitized form of the signal, The operating system 
28 transfers the digital command signal to the speech recognition system 30, 
where the command is recognized by the speech recognition engine 32. 
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Fig. 3 illustrates an architecture for a finite grammar speech recognition 
system using multiple constraints during the recognition process. Generally, 
the speech recognition engine 32 receives the digitized speech signal from the 
operating system 28. The signal is subsequently transformed in representation 
block 38 into a useful set of data by sampling the signal at some fixed rate, 
typically every 10-20 msec. The representation block produces a new 
representation of the audio signal which can then be used in subsequent stages 
of the voice recognition process to determine the probability that the waveform 
portion just analyzed corresponds to a particular phonetic event. This process 
is intended to emphasize perceptually important speaker independent features 
of the speech signals received from the operating system. In classification 
block 40, the processed speech signal is used to identify a subset of probable 
phrases corresponding to the speech signal. This subset of probable phrases is 
searched at block 42 to obtain the recognized phrase. 

Referring still to Fig. 3, classification block 40 is preferably performed by 
acoustic modeling block 44, context modeling block 46 and lexical/grammatical 
modeling block 48. At acoustic modeling block 44, known algorithms process 
the speech command signal to adapt speaker-independent acoustic models, 
contained in memory 16, to the acoustic signal of the current speaker and 
identify one or more probable matching phrases. 

At block 46, additional algorithms may be used to process the speech 
signal according to the current state of the computer system as well as context 
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events, including prior connnnands, systenn control activities, timed activities, 
and application activation, occurring prior to or contemporaneously with the 
spoken command. Specifically, these data structures include activities such as: 
user inputs by voice, mouse, stylus or keyboard; operation of drop-down 
menus or buttons; the activation of applications or applets within an 
application; prior commands; and idle events, i.e., when no activity is logged in 
an event queue for a prescribed time period. The system states and events can 
be statistically analyzed, using statistical modeling techniques, to identify one 
or more probable commands matching the context in which the command was 
given. 

At block 48, algorithms conform the digitized speech signal to lexical and 
grammatical models. These models are used to help restrict the number of 
possible words corresponding to a speech signal according to a word's use in 
relation to neighboring words. The lexical model may be simply a vocabulary 
of words understood by the system. The grammatical model, sometimes 
referred to as a language model, may be specified simply as a finite state 
network, where the permissible words following each word are explicitly listed, 
but is preferably a more sophisticated finite grammar having a plurality of 
grammar sets containing multiple command patterns, as described below. 

In its preferred embodiment, the present invention includes all three of 
the above-described modeling techniques. However, the invention is not 
limited in this regard and can be performed using alternative modeling 
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techniques. For example, it may be practiced without the event-based context 
modeling step. Also, each modeling technique may be performed 
independently from, or interdependently with, the other models. 

Referring now to Fig. 4, at step 50, the system receives a user input in 
the form of a spoken utterance. Spoken utterances can be issued as ordinary 
dictation, voice commands or voice commands incorporating dictation. 
Ordinary dictation is a spoken utterance which does not contain a pattern of 
words recognizable by the system for controlling the operation of system or 
application software. Instead, dictation is spoken merely to have the system 
convert the spoken words into text within an electronic document. Typically, a 
user issues ordinary dictation when preparing a letter or inputting data within 
an application text field. On the other hand, a voice command is a spoken 
utterance which causes the system to perform a pre-determined function within 
system or application software other than simply transcribing text, such as 
opening a file, deleting text in a document or repositioning an active "window". 
A voice command incorporating dictation is a combination of these two 
utterances, having words comprising a dictation portion embedded within a 
pattern of words comprising a command. 

Although the system of the present invention can be used to recognize 
all three types of spoken utterances, the invention is intended to address the 
unique difficulties in recognizing voice commands incorporating dictation. . 
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Accordingly, the following discussion will focus on the voice commands mixed 
with dictation. 

Typical speech systems are likely to interpret these mixed spoken 
utterances as ordinary dictation, transcribing the entire spoken phrase as text in 
a document, or the dictation may be ignored. As mentioned above, the primary 
reason existing speech systems have difficulty with these types of mixed voice 
commands is that the command grammars are, by necessity, coded with a 
limited number of command patterns. Because the vast majority of phrases 
used in ordinary dictation are left out of the command grammars, typical finite 
speech systems are unable to incorporate the dictation portion within the 
command. 

Referring still to Fig. 4, the process advances to step 52, wherein the 
system recognizes whether the spoken utterance contains a recognizable 
pattern of words by comparing the recognized words against a plurality of pre- 
determined pommand patterns of words contained in one or more active 
command pattern sets or grammars. Individual command pattern sets are 
placed in an active state depending upon the state in which the computer 
systenh is operating when the voice command is issued. 

An exemplary mixed voice command is, "schedule a meeting on 
Thursday regarding next quarter's sales plan". This utterance is issued by a 
user to initiate a voice-enabled scheduling application and insert the text "next 
quarter's sales plan" in a meeting text field at a calendar location for Thursday. 
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The utterance is comprised of tine command portion "schedule a meeting on 
Thursday regarding" and the dictation portion "next quarter's sales plan". 

The dictation may be comprised of any set of words in a voice 
recognition vocabulary, which could consist of tens of thousands of words. 
The pattern of words forming the command, on the other hand, must conform 
to the limited command patterns coded into one or more of the active 
command grammars. Thus, the system searches the active command 
grammars for a command pattern corresponding to the recognized speech 
signals. 

For the above example, a corresponding command grammar can be 
coded to include separate scheduling commands for each day of the week. 
Preferably, however, a primary command pattern is coded into the grammar 
having a "day" variable marker indicating that the day of the week may any one 
of the days coded as a sub-command pattern in the same or a different 
grammar. In this way, the system can schedule a meeting for any day of the 
week with only one primary pattern coded into the grammar. This technique is 
not limited to the days of the week, and can be employed for any other 
category of terms such as months, colors, names, telephone numbers, etc. 

At step 54, if the system is unable to match the words of the spoken 
utterance with a corresponding pre-determined command pattern, the process 
advances to step 56, at which point the entire phrase is deemed to be 
dictation. The system then identifies one or more text phrases in a vocabulary 
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set corresponding to the entire spoken utterance and inserts the text in an 
active! software application. 

Otherwise, at step 58, if a matching command pattern is identified in the 
preceding step, the system identifies a corresponding computer system 
command expression. A computer system command is a functional expression 
used by the system or application software for performing an event. The 
speech recognition engine coordinates the grammar with a suitable scripting 
program to cast the computer system command in a form recognizable to the 
desired speech-enabled system or application software for performing the 
desired event. The command expression includes one or more parameters 
corresponding to the voice command. At step 60, at least one of these 
parameters is extracted from the dictation portion of the voice command. The 
entire dictation portion may constitute a command expression parameter, or it 
may be broken down into sub-portions used as separate parameters. 

It will be appreciated by those skilled in the art that the precise 
mechanism for identifying and scripting the command expression can vary from 
system to system. One approach involves the use of translation or rewrite 
rules. These rules are coded within the command grammars to generate the 
command expression. For instance, a translation rule for the above example is 
"schedule a meeting for <day> regarding <text> ^ 

schedulemeeting(<day>,<text>)". This rule includes two variable parameters 
<day> and <text>. Appropriate day parameters are provided as a sub- 
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pattern coded within the active command grammar. The day of the weel< 
spoken by the user is matched against this set of sub-patterns and used to 
identify the intended day of the week in the calendar. The text parameter is 
extracted from the entire dictation portion "next quarters sales plan". Thus, 
applying the rule to the spoken utterance of the example, the scripting program 
generates the computer system command: "schedulemeeting(Thursday, next 
quarter sales plan)". 

This is one example of a translation rule and it will be recognized by 
those skilled in the art that many different translation rules are possible for use 
with different commands and formats, all such rules being within the scope of 
the invention. For systems that do not support translation rules, different 
methods of producing the command expression of the command would be 
required. For example, a parsing program can be used for this purpose to parse 
annotation in the grammar set. it will be appreciated by those skilled in the art 
that any suitable procedure can be used to accomplish the foregoing result 
provided that it is capable of taking a given phrase and determining its 
functional command expression or result. 

Referring still to Fig. 4, at step 62, the command expression is sent to 
the active application to perform the event. In particular, for the above 
example, the application opens a scheduling program and inserts "next 
quarter's sales plan" in a suitable meeting text field for Thursday. The process 
then returns to step 50 to receive additional user input. 
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The above example illustrates that the present invention can be used to 
insert or "paste" the dictation pprtion of the spoken voice command into a 
system or application program. However, the present invention is not limited in 
this regard as the dictation portion may be incorporated into a computer system 
command to perform any number of functions or events. For example, the user 
nnay Issue the command "load alffiies regarding first quarter results". In this 
example, the system will recognize "load all files regarding" as a pattern of 
words matching a pre-determined grammar command. A translation rule such 
as "load all files regarding <text>" o loadfiles(<text>)", having a single 
parameter comprising the dictation portion "first quarter results" is applied to 
create the computer system command "loadfiles(first quarter results)". This 
command is then used, for example, by a word processing application to 
search the file names of all stored documents for the text "first quarter results", 
or the closest match, and then to open up the coi-responding files. i 

While the foregoing specification illustrates and describes the preferred 
embodiments of the invention, it is to be understood that the invention is not 
limited to the precise construction herein disclosed. The invention can be 
embodied in other specific forms without departing from the spirit, or essential 
attributes of the invention. Accordingly, reference should be made to the 
following claims, rather than to the foregoing specification, as indicating the 
scope of the invention. 
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Claims 

We claim: 

1 1. In a computer system adapted for speech recognition, a method 

2 for executing a voice command in the form of a spoken utterance, comprising 

3 the steps of: 

4 receiving a user input corresponding to said spoken utterance; 

5 processing said user input to identify a pattern of words forming said 

6 spoken utterance which match a pre-determined command pattern; 

7 identifying a computer system command corresponding to said pre- 

8l determined command pattern, said computer system command having at least 

9^ one parameter; 

if; extracting said at least one parameter from a dictation portion of said 

IT voice command exclusive of said pattern of words; and 
T2| processing said computer system command to perform an event in 

13j accordance with said at least one command parameter. 

1 2. The method according to claim 1 wherein at least one word 

2 forming said dictation portion of said voice command is embedded within said 

3 pattern of words matching said command pattern. 



1 



2 



3. The method according to claim 1 wherein said step of identifying 
said computer system command is performed by using a translation rule. 
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1 4, The method of claim 1 wherein said dictation portion of said voice 

2 command is comprised of any set of words in a voice recognition engine 

3 vocabulary. 

1 5. The method of claim 4 wherein said event includes inserting said 

2 dictation portion at a specified location defined by said computer system. 

1 6. The method of claim 1 wherein a plurality of said pre-determined 

2 command patterns are provided. 

'•' \ 

fc 7. The method of claim 5 wherein each of said plurality of command 

2-^ patterns belongs to at least one pre-determined command pattern set. 



t\ 8. The method of claim 6 wherein a command pattern in any of said 

2j sets can only be matched'when said set is in an active state. 

1 9. The method of claim 8 wherein said set is placed in an active state 

2 when said computer system is in a pre-defined computer system operating 

3 state. 
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1 10. The method of claim 1 further comprising the step of: 

2 providing recognized text to a software application if no pattern of 

3 words forming said spol<en utterance matches said pre-determined command 

4 pattern. 

1 1 1 . A computer speech recognition system for executing a voice 

2 command in the form of a spoken utterance, comprising: 

3 receiver means for receiving a user input corresponding to said spoken 

4 utterance; 

jSj processor means for processing said user input to identify a pattern of 

$; words forming said spoken utterance which match a pre-determined command 

f: pattern; 

W identification means for identifying a computer system command 

9i corresponding to said pre-determined command pattern, said computer system 

1©i command having at least one parameter; 

1¥ means for ^extracting said at least one parameter from a dictation portion 

12 of said voice command exclusive of said pattern of words; 

13 wherein said processor means processes said computer system 

14 command to perform an event in accordance with said at least one command 

15 parameter. 
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12. The system of claim 1 1 wherein at least one word forming said 
dictation portion of said voice command is embedded within said pattern of 
words matching said command pattern. 

13. The system of claim 1 1 wherein said identification means uses a 
translation rule to identify said computer system command. 

14. The system of claim 1 1 wherein said dictation portion of said 
voice command is comprised of any set of words in a voice recognition engine 
vocabulary. 

15. The system of claim 14 further comprising an insertion means, 
wherein said event includes said insertion means inserting said dictation portion 
of said voice command at a specified location defined by said computer 
system. 

1 6. The system of claim 1 1 wherein a plurality of said pre-determined 
command patterns are provided. 

17. The system of claim 1 6 wherein each of said plurality of command 
patterns belongs to at least one pre-determined command pattern set. 
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1 18. The system of claim 17 wherein a command pattern in any of said 

2 sets can only be matched when said set is in an active state. 

1 19. The system of claim 18 wherein said set is placed in an active state 

2 when said computer system is in a pre-defined system operating state. 

1 20. The system of claim 1 1 wherein said processor means provides 

2 recognized text to a software application if no pattern of words forming said 

3 spoken utterance matches said pre-determined command pattern. 



^ 21 . A machine readable storage, having stored thereon a computer 

program having a plurality of code sections executable by a machine for 

^ causing the machine to perform the steps of: 

M receiving a user input corresponding to a voice command in the form of a 

S| spoken utterance; 

# processing said user input to identify a pattern of words forming said 

7 spoken utterance which match a pre-determined command pattern; 

8 identifying a computer system command corresponding to said pre- 

9 determined command pattern, said computer system command having at least 

10 one parameter; 

1 1 extracting said at least one parameter from a dictation portion of said 

12 voice command exclusive of said pattern of words; and 
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processing said computer system command to perform an event in 
accordance witii said at least one command parameter. 
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METHOD AND APPARATUS FOR 
EXECUTING VOICE COMMANDS 
HAVING DICTATION AS A PARAMETER 

ABSTRACT OF THE DISCLOSURE 

In a computer speech recognition system, tiie present invention provides 
a method and system for recognizing and executing a voice command that has 
a dictation portion. Upon receiving a user input, the spol<en utterance Is 
processed to identify a pattern of words which matches a pre-determined 
command pattern. Then, computer system command is identified that 
corresponds to the pre-determlned command pattern and has at least one 
parameter. The parameter is extracted from a dictation portion of the spoken 
utterance which is separate from the pattern of words matching the command 
pattern. The computer system command is then processed to perform an 
event in accordance with the parameter. If the spoken utterance does not 
contain a pattern of words matching a pre-determined command pattern, then 
the spoken utterance is recognized as dictation and inserted at a specified 
location Into an electronic document or other system or application software. 
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PATTOT 
DOCKET MUMBCT; fiTKq-I^S 
M«at 

As a l»fow named inventof, I hereby <J«cIafe that: 

My r«s<denctt, post office 3d(lr*$s and citizenship are as stated below n&xt to my nam«, 

I beiteve \ am the orljinai, fk« and »oIe inventor 0^ only on» name is iisted betow} or w orroinai, first 
and joint inventor (tf plural rames are listed betow) ot tne subject matter wWch is claimed arid for which a patent 
U sought on the »T>vcntion entitled 

METHOD AND APPARATUS FOR £Xe:UTtNQ V01C£ 
COMMANDS HAVINO PKTTATION AS A PARAMlH'En 

the apedficatton of which (ch«clc cna) 

„x_ is attached hereto. 



was filed on 

und«r Attorney's Docket Number . 
as Application Serial No, 



and was amcr>ded on Gf appiicabie). 

I hereby state that I have reviewed and undamaod the contents of the above identified specification, including 
the claims, as amended by any amendment wierrtil to at)ove- 

I acknowledge the duty to disclose information which is material to the examination of this application in 
accc^dance with Title 37, Code of Federal Re<raiations Saction l.&6(d). 

i hereby daim foreign priority benefits under T?t)« 35^ United States Code 11 9 of any foreign application's) tor 
patent Of inventor*s certificate listed below and have also identified bdow any foreign application for patent oc 
inventor's cenificate having a filinfl date before that of the application on which priority is claimed: 

Prior Foreign Appiication{s) Priority Oaimed 

Yes No 

(Numbcrl (Country) (Filing Oatel 

I hereby daim the berwf it under Trde 36, Unhcd States Cod#, Section 1 20 of any United States appiication{$) 
Ustfid beiow ar>d, insofar as tha subject matter of each of the claims of this application is not discJosed in the 
prior United States application in the manner provided by the first paragraph of Trdc 35. Urwted States Code, 
Section 1 1 2 J acic/wwledge the duty to di$ctose material information as dftir>ed in Tjtle 37, Code of federal 
Regulat*on$, Section 1 .56(a) which occurred between the fifing date of the prior appftcatior^ and the rmional or 
POT intdnndtional filing date of this application: 



(Appln. Serial No,) (Bfing Date) (Status) 

I hereby declare that all statements made herein of my own knowlcdfle arc true and that ail statements made 
on information and belief are believed to be true; and further that these statements were made with tha 
knowledge that willful false statements and the Tike so made are punishable by fine or tmprtsorKnent or both, 
under Section 1001 of Trtle 18 of the Ureted Scatits Code and that such willful falsa statenr^ents may jeopardire 
the validity of the application or any patent issued thereon. 



Q0WP»U 42959 



Page 1 of 2 



Express Mail 

Label No. EE44432Q653US 



JL^ 24 1399 12:20 PR ajfiRLES «ND BRRDY LLP5S1 SS3 5333 TO 9S1^ p. 

CXXXET NUMBER: 6169-125 

POWER OF ATTORNEY; As 3 named invamor, i hereby appoint the foiTowbo attomeys and/or agentt to 
prov«cut« This appticatioft ar^d vansaci ail busincii in the Patftnt and Trademaric Office conrw^tftd ttwewjth; 

J, Modman Steele, Jr. Reft- No, 26,931 Richard A. Tomlin Reg. No. 2^,449 

Gregory A. Nelson Reg. No. 30^77 Winf ield J. Brown, Jr, R«o- No. 3 1 1 

Joseph W, Bain Begf, No. 34,290 Kenneth A. Seaman Reg. No. 28. TI 3 

Robert J. Sacco R«o- No. 35,B67 A, P. Tenr^ R«o. No. 35,807 

Scott 0. Paul Reg. No. 42.984 Norman L. Gundel Reg, No, 3C^e7 

Stsnley A. Ktm Reg, No, 42,730 

Mark D. Passler Reg. No. 40,764 

Send correspondence to Gregory A, Nelson, Quarles & Brady LLP, 222 LaJcevirw Avenue. Suite 400. P.O. 
Box 31 88, Wwt Palm Beach, Florida 33^02-3188 and direct an t*lephorw callc to Gregory A. Ndwn at 
(561) 653-5000. 



FULL fOAME OF FIRST OR SOLE INVENTOR; Thomaa A. iCiST 

INVENTOR'S SIGNATURE: c:/J^^<rw.<^ A '^^-'-^^ DAHE: ^/ 2- ^ / ^ ^ 

RESJDENCE: Soynton Beach, Florkla 
CmZENSHIP: USA 

POST OFFICE ADDRESS: 206 Monterey Square. Boynton Beach, Ftor ida 33436 



FULL NAME OF SECOND fNVENTOR, iF ANY: Bum L. LEWIS 

INVENTOR'S SiGNATUR£: DATE:. 



REBIOENCE: Ossinif>o, New Yoric 
CmZENSHlP: USA 

POST OFFICE ADDRESS: 275 Qvadeayne Road. Otsining. New York 10562 
FULL NAME OF THIRD INVENTOR, iF ANY: Bruce 0. LUCAS 

INVENTOR'S SIGNATURE: DATE:. 

RESlOENCe Yorlctown Heights, New York 
aTl2ENSHIP: USA 

POST OFFICE ADDRESS: 2408 MjH Pond Road, Yorktown Heights, New York 1 0598 
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DECUVRATiON AND POWER OF ATTORNEY FOR 
PATENT APPLICATION 



As t below named inventor. 1 hereby declare that: 

My residence, poet office address and citizenship are as stated beJo^v next to my name, 

I believe r am th% original, first and sole inventor tif only one name ia listed below) or en originai, first 
and ioint inventor (if plural names an listed belowl of the subject matter wNch is claimed and for which a patent 
is sought on the invernion entitled 

METHOD AND APPARATUS FOR EXECUTING VO»CE 
COMMANDS HAVING DICTATION AS A PARAMCTER 

the apecif ication of which (check onel 

n is anached hereto- 

_ was filed on - 



under Attorney's Docket Number , 

as Application Serial No. 

and was amended on 



. (if applicable). 



1 hereby state that I have reviewed ertd understand the contents of the above kJentified specification, including 
the claims, as amended by any amendment referred to above. 

t acknowledge the duty to disclose information which (s material to the examination of this application in 
accordance wHH Title 37, Code of Federal Regulatiorrs Sec^'on 1 .56(a). 

I hereby ctaim foreign priority benefits under Title 35, United States Code 11 9 of any foreign appiication(s) for 
patent or inventor's certificate listed below and have also Identified below any foreign application for patent or 
inventor's certificate having a filing date before that of the application^ on which pnority is claimed: 



Prior Foreign Apprtcation(sJ 



(Number) 



Priority Claimed 



(Country) 



Yes 



„No 



{Fiting Date) 



I hereby claim the benefit under Title 3S, United States Code. Section 1 20 of any United States application(s) 
listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in the 
prior United States appticacion in the msnner provided by the first paragraph of Title 35, United States Code, 
Section 112J acknowledge the duty to disclose material Information as defined in Title 37, Code oF Federal 
Reoulatk>ns, Section 1.56ia) which occurred between the filing date of the prk>r application end the national or 
PCT international fifing date of this applicetton: 



(Appln. Serial No.) 



(Filing Date) 



(Status) 



I hereby declare that ait statements made herein of my own knowledge are true and that ail statements made 
on information and belief are believed to be true; and further that these statenvents were made with the 
knowledge that willful false statements and the tike so made are puntshabia by fine or imprisonment, or both, 
under Section 1001 of Titfe IS of the United States Code and that such willful false statements may jeopardise 
the validity of the application or any patent Issued thereon. 



QSWPBM 42959 



Page 1 of 2 



TO'd 



I79t7t7 St^e VIS 



Express Mail 

Label Uo. EE444320653US 

II dno^3 qoQsds lAiai dfS^SO 66-tO-Lnr 



. JUN 24 .99. P, ,,,,, ^^^^^ ^^^^ S,9..S.S.4s! P. 33.35' 

OOCKCT rfUMBgft : 6 7 65-125 

POWER OF ATTOnNEV: As i nam«d inventor, t hereby appoint the foilowinfl attorneys 9nd/or agents to 
prosecute this application and transact all business in the Patent and Traden^arfc Offic« connected Therewith: 



J. Hodman Steele, Jr. Aea- No. 25,d31 

GreQory A. Neteon Reg. No. 30,577 

Joseph W. Bain Reg. No. 34^290 

Robert J. Sacco Re«. No. 3S,667 

Scott D. Fau( Reg* No. 42,984 

Stanley A. Ktm Reg. No. 42,730 

Merle O. Passler Reg. No. 40,764 



Richerd A, Tomlin 
Winfield J, Brown. Jr, 
Kenneth A, Seaman 
A. P, Tennent 
Norman L. Gundel 



Reg. No. 24,443 
Reg. No. 31,901 
Reg. No. 28Jt3 
Reg. No. 35.807 
Reg. No, 30,387 



fiend correspondence to Gregory A. Nelson, Quarles & Grady LLP, 222 Lakeview Avenue, SuHe 400^ P.O. 
Ecit 31 SB. West Palm Beach, Florida 33402-31 S8 and direct aU telephone catis to Gregory A, Nelson at 
1561} 653-5000. 



FULL Hme OF FIRST OR SOLE WVENTOR: Thoma* A. KlST 
INVENTOR'S S^GNATUR«: 



DATE:. 



REStDENCE; Boynton Beach, Florida 
CITIZENSHIP: USA 

POST OFFICE AODR£SS: 206 Monterey Square. Soynton Beach, norida 33436 



FUU NAME OF S£CONO INVENTOR, W 
INVENTOR'S SIGNATURE: 




RESIDENCE: Ossining, New York 
CmZENSHIP: USA 

POSTCfFiCe AODRESS.- 275 Chedcayne Roed, Oa*ning, New York 10562 



VENTOR. IF ANY: Bruce ^^^^^^ 



FULL NAME OF THIRD INVENTOR, IF ANY: Bruce Dx 
INVENTOR'S SIGNATURE: , 
RESIDENCE; Yorktown Heights, New York 
CmZENSHIP: USA 

POST OFHCE ADDRESS: 2408 Mill Pond Road, Yorktown Heights, New York 10S98 
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